Durability of replicated distributed storage systems
نویسندگان
چکیده
منابع مشابه
Analysis of the Durability of Replicated Distributed Storage Systems
In this paper, we investigate the roles of replication vs. repair to achieve durability in large-scale distributed storage systems. Specifically, we address the fundamental questions: How does the lifetime of an object depend on the degree of replication and rate of repair, and how is lifetime maximized when there is a constraint on resources? In addition, in real systems, when a node becomes u...
متن کاملHybrid Regenerating Codes for Distributed Storage Systems
Distributed storage systems are mainly justified due to their ability to store data reliably over some unreliable nodes such that the system can have long term durability. Recently, regenerating codes are proposed to make a balance between the repair bandwidth and the storage capacity per node. This is achieved through using the notion of network coding approach. In this paper, a new variation ...
متن کاملA Distributed and Replicated Service for Checkpoint Storage
As High Performance platforms (Clusters, Grids, etc.) continue to grow in size, the average time between failures decreases to a critical level. An efficient and reliable fault tolerance protocol plays a key role in High Performance Computing. Rollback recovery is the most common fault tolerance technique used in High Performance Computing and especially in MPI applications. This technique reli...
متن کاملMinimizing Access Costs in Replicated Distributed Systems
Physical communication links are all owned by some organization or another. The owning body may choose to let transactions utilize their resources freely without charge or they may assign a cost for the usage. Typically the cost of using a communication link is either based upon a flat-fee per transaction, the number of messages transmitted over the link for the transaction, the submission time...
متن کاملSite Recovery in Replicated Distributed Database Systems
A solution to the problem of integrating a recovering site inLo a distributed database system is presented. The basic idea used for the correct recovery is to maintain a consistent view of the status (up or down) of all sites. This view need not be the exact current status of the sites. but is the status as perceived by other sites. The session number is used to represent the actual state of a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM SIGMETRICS Performance Evaluation Review
سال: 2008
ISSN: 0163-5999
DOI: 10.1145/1384529.1375514